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(57) A series of action samples comprising combi- 
nations of input and output variables is obtained by 
operating or controlling the object. By giving attention to 
the input variables of the action samples, a goal case 
defined by using a first input variable, a final input varia- 
ble, a maximum value and a minimum value thereof is 
formed for every input variable. By giving attention to the 
output variables of the action samples, a range of input 
variables is picked up in which the output variables 
maintain identity, and a strategy case by which the 



range of input variables is coupled to the output varia- 
bles is formed for each of the ranges. A strategy case 
having a range of input variables without satisfying the 
condition of the goal case is finked to the goal case. An 
action case having an input variable contained in the 
range of input variables of the strategy case is linked to 
the strategy case. There is thus formed a model hierar- 
chy structure constituted by goal cases, strategy cases 
and action cases that are linked to each other. 
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Description 

Technical Field 

s This Invention relates to a reasoning system and method for building a model using historical data obtained when 

a human being operates or controls an object, particularly a model having a hierarchy structure constructed from mutu- 
ally associated goals, strategies and actions. The invention relates further to a system and method lor automatically 
operating or controlling an object using the model constructed. 

The system and method according to the present invention finds use in factory automation, particularly real-time 

10 dynamic control operations in manufacturing and transportation facilities. 

Background Art 

The goal of factory automation is to improve production efficiency by developing electromechanical devices and 
is computer algorithms in order to process sensor signals and control actuators. A number of such devices and algorithms 
have relieved human beings from tedious, intensive, repetitive and often dangerous tasks. Nevertheless, human beings 
still engage in tasks that are desired to be automated in manufacturing and other industrial environments. Automation 
should be expedited in order to improve productivity, reduce cost, reduce waste, improve quality, improve labor safety 
and reduce training cost. 

20 In a society based upon competitive manufacturing, where a wider variety of products are produced at a faster rate 
with better quality, more sophisticated means for factory automation are necessary. One of such means is an apparatus 
for building an automation mode! for human operation. 

In the field of artificial intelligence (Al), which concerns human knowledge, there are many approaches with regard 
to modeling, namely how to build a model of such intelligence. 

25 The most common approach is an expert system. An expert system expresses expert knowledge, which has been 
acquired through interviews, as a group of rules. The rules of one group are processed by a logical inference engine to 
produce explanations, predictions or suggestions. The logical inference mechanism involves a variety of methods inclu- 
sive of fuzzy logic. In general, rules represent knowledge at a conceptual or situational level. 

An approach which is more data oriented is referred to as "case-based reasoning" This is a learning strategy in 

30 which cases are stored in memory and compared. More specifically, case-based reasoning interprets new situations by 
comparing and contrasting them with previous similar situations stored in memory. It recognizes new situations, makes 
new inferences and asks questions when inferences cannot be made. In general, cases express knowledge at an oper- 
ational level. 



35 Disclosure of the Invention 

An object of the present invention is to provide a new approach broadened to cover both a rule-based approach 
and the case-based approach. 

According to the present invention, the following action principle is fundamental to modeling the operation or control 
40 of an object by a human being and reproducing human operation or control by machine in accordance with the created 
model: "Some strategy exists for an action, and some goal exists for a strategy". 

According to the present invention, a record of actual operation or control performed by a human being is adopted 
as an information source. 8y extracting strategies from the record and deducing goals from the strategies, a model of 
a hierarchy structure comprising goals, strategies and actions Is constructed. 
46 A reasoning method according to the present invention is as follows when defined in its most genera! form: 

Specifically, the reasoning method according to the present invention comprises steps of accepting a series of 
action data each comprising a set of input and output variable values obtained by operating or controlling an object; cre- 
ating, for each input variable, in view of the input variable values in the accepted action data, a goal of the operation or 
control; creating, for each output variable, in view of the output variable values in the accepted action data, a strategy 
so for attaining the goal; associating the goals with strategies which have not yet attained the goals; and associating the 
strategies with action data which implement them; thereby building a model of a hierarchy structure constructed from 
mutually associated goals, strategies and actions. 

Conventional case-based reasoning stops at the experience level, namely at tie action cases, and does not prom- 
ise any possibilities beyond this. Suitable solutions are not obtained when cases lor which there is no past experience 
55 arise. 

According to the present invention, goals are created from actual action samples, and strategies for achieving these 
goals are created. The goals and strategies are then linked together. Accordingly, the aim ( goal) is determined at the 
conceptual level for the sake of operation or control, and decision making conforming to the situation is carried out by 
means of tie strategy associated with the goal. Finally, concrete decision making (the deciding of output values) is per- 
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formed by the action cases that are associated with the strategies. 

in rule-based reasoning which Includes fuzzy rules, it is necessary to interview experts and codify the knowledge 
acquired into rules in order to create the rules. According to the present invention, goals and strategies are created 
automatically from a record of past operations. 
5 The reasoning method of the present invention when expressed in more concrete form is defined as follows: 

Specifically, the reasoning method according to the present invention comprises steps of accepting a series of 
action samples each comprising a set of Input and output variable values obtained by operating or controlling an object; 
creating, for each input variable, a goal case stipulated using an initial value, a terminal value, a maximum value and a 
minimum value of the input variable value in the accepted action samples; extracting zones of input variables in which 
w the output variable value in the accepted action samples remains fixed, and creating a strategy case, with regard to 
each zone, which links the zone of input variables to the output variable value; associating a strategy case, which pos- 
sesses a zone of input variables that does not satisfy a condition of a goal case, with the goal case; and associating an 
action case, which possesses input variable values that fall within the zone of input variables in a strategy case, with the 
strategy case; thereby building a model of a hierarchy structure constructed from mutually associated goal cases, strat- 
us egy cases and action cases. 

In a preferred embodiment, the accepted action samples are converted to binary data of prescribed bits. 

In another preferred embodiment, the step of creating a goal case includes a step of creating a provisional goal 
case, a step of outputting the provisional goal case and a step of adopting a goal case, after it has been evaluated by 
a user, as a final goal case. 
20 The evaluation performed by the user includes adding, deleting or modifying goal cases. 

More specifically, the step of creating strategy cases includes a step of creating strategy samples by successively 
comparing neighboring output variable values arrayed in a time sequence, and storing, when a change has. occurred, 
the output variable values immediately before and immediately after the change, a step of converting the strategy sam- 
ples to strategy cases, which possess a "do * * * when * * * until * * * min * * * max" format, by referring to the action 
25 samples, and a step of optimizing the strategy cases after conversion. 

An example of the step for optimizing the strategy cases is to reduce the number of two or more strategy cases hav- 
ing identical output variable values. 

In concrete terms, the linking of strategy cases and goal cases is implemented in memory by pointers to the goal 
cases provided for each of the strategy cases and pointers to the strategy cases provided for each of the goal cases. 
so In concrete terms, the linking of action cases and strategy cases is implemented in memory by pointers to the strat- 
egy cases provided for each of the action cases and pointers to the action cases provided for each of the strategy 
cases. 

An action case is created by leaving one and deleting the others of action samples comprising sets of all identical 
input and output variable values. 

35 The model thus created is verified by using the above-mentioned series of action samples as input/output data. 

More specifically, verification is performed by steps of accepting the action samples one at a time in the time 
sequence thereof; finding goal cases having conditions not satisfied by the input variable values of an accepted action 
sample and listing strategy cases associated with the goal cases found; finding, from among the strategy cases listed, 
a strategy case having a condition satisfied by the accepted input variable values and listing action cases associated 

40 with the strategy case found; and, from among the action cases listed, outputting an output variable value of an action 
case having input variable values that agree with or are nearest to the accepted input variable values. This output value 
and the output variable value of the action samples is compared by the user, whereby the model is verified. 

A method for operating or controlling an object using the model created by the above-described reasoning method 
also is provided by the present invention. 

45 This method comprises the steps of successively accepting input data obtained from an object; finding goal cases 
having conditions not satisfied by the accepted input data and listing strategy cases associated with the goal cases 
found; finding, from among the strategy cases listed, a strategy case having a condition satisfied by the accepted input 
data and listing action cases associated with the strategy case found; from among the action cases listed, outputting an 
output variable value of an action case having \np\A variable values that agree with or are nearest to the accepted \npu\ 

so data; and repeating the foregoing processing whenever input data is accepted 

The present invention provides also a system which executes the above-described reasoning method and control 
method. 

A reasoning system according to the present invention comprises means for accepting and storing a series of 
action samples each comprising a set of input and output variable values obtained by operating or controlling an object; 
as means for creating, for each input variable, a goal case stipulated using an initial value, a terminal value, a maximum 
value and a minimum value of the input variable value in the accepted action samples; means for extracting zones of 
input variables in which output variable value in the accepted action samples remains fixed, and a creating strategy 
cases, with regard to each zone, which links the zone of input variables to the output variable value; means for associ- 
ating a strategy case, which possesses a zone of input variables that does not satisfy a condition of the goal case, with 
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the goal case; and means for associating an action case, which possesses input variable values that fall within the zone 
of input variables in a strategy case, with the strategy case; whereby a model of a hierarchy structure constructed from 
mutually associated goal cases, strategy cases and action cases is built 

A system for operating or controlling an object using a model created by the above-described reasoning system 

5 comprises means for successively accepting input data obtained from an abject; means for finding goal cases having 
conditions not satisfied by the accepted input data and listing, in memory, strategy cases associated with the goal cases 
found; means for finding, from among the strategy cases listed, a strategy case having a condition satisfied by tie 
accepted input data and iisting, in memory, action cases associated with the strategy case found; means for outputting, 
from among the action cases listed, an output variable value of an action case having Input variable values that agree 

10 with or are nearest to the accepted input data; and means for repeating the foregoing processing whenever trtpu/t data 
is accepted. 

Other features and advantages of the present invention will be obvious in the description of the embodiments ren- 
dered with reference to the accompanying drawings. 

15 Brief Description of the Drawings 

Fig, 1 is a block diagram showing the overall configuration of a system which includes a controlled objected in a 
modeling mode; 

Fig, 2 is a block diagram showing the overall configuration of a system which includes a controlled objected in a 
20 problem-solving mode; 

Fig. 3 is a block diagram showing an example of the construction of a system according to the present invention, in 
which use is made of hierarchy model; 

Fig. 4 is a perspective view showing a crane control system as an example of a controlled object; 
Fig. 5 illustrates another example of an arrangement for detecting angle of sway; 
25 Fig. 6 is a block diagram showing a system for simulating crane control; 
Fig- 7 illustrates variables in the simulation; 

Figs. 8a and 8b are flowcharts showing a processing procedure in a modeling mode; 
Fig. 9 illustrates an example of input and output data obtained in simulation of crane control; 
Fig, 10 illustrates binary strings obtained by the binary encoding the input and output data shown in Fig. 9; 
so Fig. 1 1 illustrates an example of a list, which is for creating goal cases, created for each input variable in simulation 
data for crane control; 

Fig. 12 illustrates an example of goat cases in crane control; 

Fig. 13 illustrates an example of strategy samples in crane control; 

Figs. 14 through 16 illustrate a simple example for describing conversion of strategy samples to strategy cases, in 
35 which Fig. 14 shows an example of action samples, Fig. 15 an example of strategy samples and Fig. 16 a graph of 
action samples; 

Fig, 17 is a graph of action samples in crane control; 

Figs. 18a and 18b show an example of strategy cases in crane control; 

Fig, 19 illustrates the binary strings of the strategy cases of Figs, 18a and 18b; 
40 Fig. 20 illustrates the linkage of each goal case to strategy cases; 

Fig. 21 illustrates the linkage of each strategy case to goal cases and action cases; 

Fig, 22 illustrates tie linkage of each action case to strategy cases; 

Fig, 23 illustrates a hierarchy structure comprising goals, strategies and actions; 

Fig. 24 is a flowchart showing a processing procedure in a problem-solving mode; and 
45 Fig, 25 shows an example of input and output data in the problem-solving mode. 

Best Mcfle,,for,,,parr,ying..Qut, the, .Invention 

(1) Two modes in one system 

50 

The system according to the present invention using a hierarchy model possesses two modes. The first is a mod- 
eling mode and the second is problem-solving mode. 

In the modeling mode, data (manipulated variables and controlled variables) obtained when a controlled object {a 
system to be controlled) is actually operated by a human being (an operator) are entered and a model having a hierar- 
55 chy structure, described below, is constructed. 

in the problem-solving mode, the controlled object is actually controlled in accordance with a model already con- 
structed. 

Fig, 1 is a block diagram showing the overall configuration of a system which includes a controlled objected in the 
modeling mode. The system according to the present invention is embodied as a modeling system (or a learning sys- 
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tern), 

in general, the controlled variables of a controlled object (system to be controlled) 10 are sensed by sensors 11 , 
12, ■ • • 1k. The sensed controlled variables (x lf x 2 , ■ * * xjj are applied to an operating unit 20. While observing the 
status of the controlled object and, when necessary, while observing the vaiues of the controlled variables displayed 

5 (e.g., while checking the position of a needle on a meter), the operator manipulates (operates) the system In such a 
manner that the controlled object will attain a desired state. A manipulated variable y {output data) (which is not limited 
to one type) outputted by the operating unit 20 is applied to an actuator 21 , which actually drives or operates the con- 
trolled object 10. There may be instances in which some or none of the controlled variables , x 2 , ♦ * • , are applied 
to the actuator 20. The controlled variables may be logical values (or computed values or inferred values), which would 

w be obtained from the controlled object 1 0. 

The controlled variables x 1 ~ detected by the sensors 11 — 1k and the manipulated variable y outputted by the 
operating unit 20 enter a modeling system 1 A. It may be so arranged that the output of the actuator 21 or a measured 
value thereof enters the modeling system 1 A as the manipulated variable. The modeling system 1 A uses the input data 
x 1 - *k and the output data y to create a mode! relating to control (or operation). 

15 Fig. 2 illustrates the overall configuration of a system which includes a controlled object in the problem-solving 
mode. The system according to the present invention is embodied as an automatic control system (automatic operating 
system) 1 B. 

The automatic modeling system 1B accepts the controlled variables x-j ~ x^ of the controlled object 10 sensed by 
the sensors 1 1 ~ 1k and calculates the manipulated variable y, which is to be applied to the controlled object 10, in 
20 accordance with the model created previously. The controlled variable y is applied to the actuator 21 that drives the con- 
trolled object 10 (or the actuator 21 is controlled in such a manner that its output will attain the calculated value). As a 
result, the controlled object 10 is controlled (operated) by the automatic control system 1 B in the same manner that the 
operator controlled (operated) the controlled object using the operating unit 20, 

25 (2) System configuration and example of application 

Fig, 3 illustrates an example of the configuration of a system using a hierarchy model according to the present 
invention embodied as the modeling system or automatic control system, depending upon the particular mode. 

This system basically is implemented by a computer 30, The computer 30 utilized can be a personal computer or 
so any other general-purpose computer irrespective of whether it is a large- or small-scale computer. 

The computer 30 is provided with a program memory 31 storing programs for executing processing in the modeling 
mode and problem-solving mode, and a data memory 32 for storing data (inclusive of data representing a model) gen- 
erated or used in these processing operations. The memories 31, 32 are implemented by a ROM, RAM, hard disk, 
floppy disk, optical disk, etc. The memories 31 and 32 may both be implemented by hard disks, the memory 31 by a 
35 ROM and hard disk or floppy disk, and the memory 32 by a RAM and hard disk. Here the memories 31 and 32 are con- 
ceptions classified by the type of data stored 

An mpu\ unit 33 is connected to the computer 30, as well as a display unit 34 and a printer 35 that serve as output 
units. The input unit 33 includes a keyboard and a mouse. The display unit 34, which displays menu screens, may also 
be considered as part of the input unit 33, At least one of the display unit 34 and printer 35 (preferably the display unit 
40 34) will suffice as the output unit 

Also connected to the computer 30 are A/D converters 36 for converting the controlled variables (usually analog 
quantities) of the controlled object 1 0 sensed by the sensors 11 - 1 k to digital data, an A/D converter 37 for converting 
the manipulated variable (if it is an analog quantity), which is outputted by the operating unit 20 in the modeling mode, 
to digital data, and a D/A converter 38 for converting the manipulated variable (usually a digital value), which is calcu- 
45 lated and applied to the actuator 21 in the problem-solving mode, to an analog quantity as necessary. These A/D con- 
verters and D/A converters can be utilized as internal converters of the computer. Further, if the A/D converters and D/A 
converters are unnecessary, they need not be provided. For example, if the actuator is one which accepts digital quan- 
tities, the D/A converter 38 is unnecessary. 

The computer 30 would be provided with interfaces (differentiating circuits and the iike r described later) as neces- 
so sary for entering controlled variables or manipulated variables and for outputting manipulated variables. 

Fig. 4 illustrates a crane control system as an example of the controlled object. For the sake of simplicity, the crane 
system is one which includes only one rail 50, 

The one rail 50 is suspended horizontally at a suitable height. A trolley 40 is supported on the rail 50 by wheels 41 
so as io be free to move. The wheels 41 are rotativefy driven by a motor 42 directly or via a reduction mechanism. 
55 The trolley 40 is further provided with a hoist. The hoist includes a pulley 44, a hoist motor 46 which drives the pul- 
ley 44, and a wire 47 wound round the pulley 44. The pulley 44 is secured to a rotary shaft 45 received freely rotatabiy 
by brackets 43 fixed to the trolley 40. The rotary shaft 45 is rotatlvely driven by a motor 46 directly or via a reduction 
mechanism. Attached to the lower end of the wire 47 is a hook 48 from which a load 49 is suspended. 

Assume the following crane control: The suspended load 49 is carried by causing the trolley 40 to run from one 
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point A to another point B on the rail 50. At this time the travel of the trolley 40 is controlled in such manner that the 
angle of sway of the wire 47 (the angle of sway in the direction in which the trolley 40 moves) will be as small as possible 
(i.e., in such a manner that the angle of sway will fail within a predetermined angular range, which shall be referred to 
as the "maximum allowable angle of sway"). 
5 The position of the trolley 40, the angle of sway and a differentiated value of the angle of sway are employed as the 

controlled variables. 

The position of the trolley 40 is taken as being in the positive direction, namely the direction toward the end point 
B, with the starting point A serving as the origin. This position is sensed by a rotational transducer (not shown) provided 
on the rotary shaft of the motor 42 or on the shaft of one of the wheels 41 . A pulse train (or digital data representing 
id position) outputted by the rotaiionai transducer enters the computer 30 (here the A/D converter 36 would be unneces- 
sary). If necessary, an origin sensor which senses that the trolley 40 is located at the starting point A may be provided. 

The angle of say of the wire 47 is sensed by a television camera 51 , which is mounted on the trolley 40 and so 
arranged as to capture an image of the wire 47, and a processing unit for applying image processing to the video signal 
from the camera 51. This image processing unit may be implemented by part of the computer 30, in which case the 
is video signal from the camera 51 would be accepted by the computer 30 via an A/D converter 36, Image processing for 
sensing the angle of a linear object with respect to a perpendicular can be performed by a well-known technique. 

The differentiated value of the angle of sway can be computed by the computer 30. 

in a case where the load is suspended using a shaft 47A instead of the wire 47, as shown in Fig. 5, the angle of 
sway is sensed more simply. The shaft 47A has its upper end secured to the shaft 45. The latter is freely rotatably sup- 

20 ported on the brackets 43. The lower end of the shaft 47A forms a hook 48. The angle of rotation of the shaft 45 is 
sensed by an angle sensor 52. If the output signal of the angle sensor 52 is an analog quantity, then this is converted 
to a digital quantity by an A/D converter 36 before it is accepted by the computer 30. Further, the output signal from the 
angle sensor 52 is differentiated by a differentiating circuit, and the differentiated signal is accepted by the computer 30, 
via an A/D converter 36, as the differentiated value of the angle of sway. The differentiation processing can be per- 

25 formed by the computer 30 as a matter of course, if the output produced by the angle sensor 52 is a digital quantity, this 
is accepted by the computer 30 as is. 

The manipulated variable is the traveling speed of the trolley 40. The traveling speed can be expressed as the volt- 
age or current applied to the motor 42 which transports the trolley. 

This system according to the invention is applicable also to a controlled object devoid of sensors which sense con- 

30 trolled variables or an actuator to which a manipulated variable is applied. One example is a system for predicting the 
price of stock, in such case, the input data would be the prevailing stock price (regarding each issue) or the prevailing 
average stock price and would be entered from a keyboard, a communication device or a floppy disk drive, etc. The out- 
put data would be a decision to sell, a decision to buy and the amount of shares involved. These items of data would 
be outputted on the display unit or printed out by the printer. 

35 Another example is a system for predicting sales at a store. Here the input data would be the day of the week, the 
weather and inventory, etc., and would be entered from a keyboard, a communication device or a floppy disk drive, etc. 
The output data would be predicted sales, amount of purchases to be made, etc., and would be outputted on the display 
unit or printed out by the printer. 

The crane control system mentioned above will be taken as an example in the description of the modeling mode 

40 and problem-solving mode set forth below in detail. The actual operating data used in the modeling mode is data 
obtained by simulation. An example of the simulating system is depicted in Fig. 6. This system is constituted by the 
computer 30, the program memory 31, the data memory 32, the input unit 33, the display unit 34 and the printer 35. 
Processing in the problem-solving mode also is executed in this system. Fig. 6 can be said to illustrate the minimum 
structure of the system according to the present invention (though it will suffice as a manner of course if only the display 

45 unit or printer is provided as the output unit). 

In the simulation, the equations for computations used to obtain the position x of the trolley and the sway angle 9 
are as follows (see Fig. 17): 

dt 2 m i rn-j m 1 VdU 

^~( mi *T 2 )-g-e--^-^ffi a »fl 

55 

where m T represents the mass of the trolley, m 2 the mass of the suspended toad, I the length of the rope, g acceleration 
due to gravity and F the thrust provided by the motor, which is obtained by the following equation: 
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F= K 1 co 1 + K 2 e> 2 + K 3 V 

where the following hold: 
5 oo 1 «V d -V 

t 

0 

Here Ki P K 2 and K 3 are constants. 

Further, V represents the motor velocity and the target velocity of the motor which is the manipulated variable 
given by the operator In the simulation. Specifically sfi takes on the following values: 

75 

V 6 = {-40 (rn/min), 0, 40 (m/min)} 

(3) Processing in the modeling mode 

20 This section and the next section M (4) Processing in the problem-solving mode" are executed mainly by the compu- 
ter 30, Accordingly, unless stated otherwise, the specific processing set forth below should be understood as being 
processing executed by the computer 30. 

The model has a hierarchy structure comprising three levels, namely a goal, a strategy and an action. This structure 
is based upon the following principle: 
25 Tor each and every action, there is at least one strategy associated with it, and for each and every strategy, there 
is at feast one goal associated with it." 

The procedure for specific processing in the modeling mode will be described with reference to Figs, Sa and 8b. 

Step 101 

30 

The operator actually manipulates (operates) the controlled object 10 using the operating unit 20, At this time the 
controlled variables (input data) obtained from the sensors 1 1 - 1 k and the manipulated variable (output data) applied 
to the actuator 21 are sampled at fixed time intervals and fed into the computer 30. These items of input and output data 
are saved in the data memory 32 as teaching data and represent a record of actual operation. 
35 Fig, 9 illustrates an example of input and output data obtained in the simulation of the above-described crane con- 
trol system. 

The input and output data is sampled every 0.3 sec. As mentioned above, the crane runs under three types of oper- 
ations, namely forward, reverse and stop. The operation output (trolley traveling speed: output data) u is set to be equal 
to +40 (m/min) in forward operation, -40 (m/min) in reverse operation and 0 (m/min) in the stopping operation. The items 

40 of input data are trolley position x (m), sway angle 8 (rad) of the wire and rate of change (differentiated value) de of the 
angle of sway. (Hereinafter, d6/dt will be represented by d8). 

In Fig. 9, a length of time indicated by T1 is the start-up time of the trolley In this time period the operation output 
u is so adjusted that the sway angle 9 falls within a range that is as small as possible. The trolley runs at a constant 
speed during the length of time indicated by T2. in order to stop the trolley exactly at the target point B, the operation 

45 output u is adjusted in during a length of time indicated by T3. A length of time T4 shows a process in which the trolley 
substantially arrives at the point B and the sway angle e of the suspended bad is gradually attenuated, 

Step 102 

60 The input/output data is binarlzed and converted so as to form binary strings. Each binary string is referred to as 
an "action sample". 

The binary strings of the input/output data illustrated in Fig. 9 are shown in Rg, 10, The items of input data x, e, d8 
and output data u are composed of eight bits each. 

In this encoding processing, minimum values are converted to 00000000 and maximum values 1 1 1 1 1 1 1 1 for input 
ss and output variables. The minimum and maximum values of position x are -1 (m) and 22 (m), respectively The minimum 
and maximum values of sway angle e are -0.1 (rad) and +0.1 (rad), respectively. The minimum and maximum differen- 
tiated values of de are -0.1 (rad/sec) and +0.1 (rad/sec), respectively The minimum and maximum values of traveling 
speed u are -40 (m/min) and +40 (m/min), respectively, in order to make Fig. 10 easier to comprehend, it should be 
added that position x- 0 (m) , sway angle 6 = 0 (rad) , sway-angie differentiated angle value de = 0 (rad/sec) and 
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traveling velocity u = 0 (m/min) are represented by the binary values 0000101 1 , 1 0000000, 1 0000000 and 10000000, 
respectively. 

Binary encoding is particularly meaningful when the processing program for the modeling mode has been written 
in a sophisticated language. The first reason for binary encoding is that the resolution of the action samples can be 
5 changed with ease. For example, by discarding the eight lower order bits of 16~bit data, a resolution of eight bits can be 
obtained. The second reason for binary encoding is that the method of representation best suited to a genetic algo- 
rithm, which is one optimization algorithm, is a binary string. 

Step 103 

10 

Sub-step 103-1 

First, an initial action sample and a terminal action sample are extracted for each and every input variable, and the 
extracted action samples are compared by finding the difference between them. Further, a maximum value and a min- 
16 imum value are calculated for each and every input variable. 

In the example shown in Fig, 9, the action sample of each input variable at time t - 0.00 is the initial action sample 
(value) of the input variable and the action sample of each input variable at time t = 29.10 is the terminal action sample 
(value) of the Input variable. These initial and terminal action samples are enclosed by the rectangles indicated by 
dashed lines. Further, the maximum and minimum values of the input variables e, de are enclosed by the rectangles 
20 indicated by solid lines. The maximum and minimum values of the input variable x are terminal and initial values 
enclosed by the rectangles indicated by the dashed lines. 

Sub-step 103-2 

25 A list of results obtained at step S103-1 is created. This list naturally is created in the data memory 32. 
A list of the results obtained with regard to the example of Fig. 9 is illustrated in Fig, 1 1, 

Sub-step 103-3 

30 If (maximum value) - (minimum value) - 0 holds, then this input variable is disregarded. This input variable is 
masked. 

The reason for masking is that an input variable whose maximum and minimum values are equal undergoes no 
change whatsoever in the operation of the controlled object; such an input variable has absolutely no influence upon 
operation or control. 

35 

Sub-step 103-4 

in a case where the difference between the initial value and the terminal value is non-zero, then goal conditions 
regarding this input variable are expressed as follows: 

40 

goal j : x t : value of "terminal state" « (terminal value) 
goal j*fi : x , : range of "transitional state" * [min, max] 

45 The fact that the initial and terminal values regarding the input variable x { differ in actual operation of a controlled 
object means that this operation must have been carried out (by the operator) with tie intention of changing the input 
variable x^ from the Initial value to the terminal value. It can be surmised that the operation must have been carried out 
with the intention of making the range of fluctuation in the input variable Xj fall between the minimum value and the max- 
imum value. Accordingly, this means that the goal (the purpose of control) with regard to the input variable x, is to even- 

so tually bring the value of the input variable xj to the terminal value starting from the initial value, at which time it is so 
arranged that the range of fluctuation in xj falls between the minimum value and the maximum value. 

In the example shown in Fig, 1 1 , the initial and terminal values differ for the input variable x. Accordingly, the follow- 
ing goal conditions are provisionally established in regard to the input variable x: 

55 Goal 1 : x : value of "terminal state" = 9.467 

Goal 2 : x : range of "transitional state" *= {0.000, 9.467] 
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Sub-step 103-5 

In a case where the difference between the initial and terminal values is zero, the goal condition regarding this input 
variable is written as follows: 

s 

goal j : x j : range of "transitional state" = [min, max] 

In a case where the initial value and terminal value regarding a certain input variable are the same, a goal cannot 
be deduced from either the initial value or the terminal value. In such case, it would be inferred that the intention was to 
to hold the \rxpu\ variable within a fixed range in the course of operation. This is the meaning of the goal conditions men- 
tioned above. 

The initial value and terminal value are the same (both are 0.000) for the sway angle 6 and differentiated value de 
thereof in the example illustrated in Fig. 11. Accordingly, the following goal conditions are provisionally established in 
regard to the input variables 6, d6: 

15 

Goal 3:9: range of "transitional state" « [-0.029, 0,027] 
Goal 4 : d9 : range of "transitional state" = [-0.0329, 0.042] 

bo Sub-step 103-6 

The results of the above-described sub-steps 1 03-3, 1 03-4 and 103-5 are printed out by the printer 35 or displayed 
on the display unit 34. 

25 Sub-step 103-7 

The above-mentioned goal conditions are encoded into binary strings in the following array: 

"Goal No. Input Variable Type Flag Value(s)" 
This array will be described in greater detail in the description of step 104, which is next. 

30 

Step 104 

The operator observes the goal hypotheses (the goals 1-4 provisionally established} printed by the printer 35 or 
displayed on the display unit 34, evaluates these hypotheses and adds new goal conditions to these, deletes any goal 
35 condition or modifies goal conditions as necessary. 

in the example of the crane control system, goal 2 in the established hypotheses (goals 1 - 4) is deleted as being 
unnecessary. The reason for this is that goal 2 relates to the range of the transitional state of the input variable x (which 
represents position). Since the position x need only be increased monotonously from the initial value to the terminal 
value, the goal hypothesis concerning the range of the transitional state is unnecessary As a result, the goal numbers 
40 of goals 3 and 4 are each moved up by one to make these goals 2 and 3, respectively. 

What are finally obtained through the. process of evaluation by the operator are goal cases, in the example given 
above, the three goal cases eventually obtained are illustrated in Fig. 12 along with their binary strings. 

The array of the items in the binary strings and the binary data will be described (see sub-step 103-7). 

The numbers 1, 2, 3 of the goals 1, 2, 3 at the head of the arrays are "goal case numbers (goal numbers)", and 
45 each is represented by two bits. 

The immediately following x t 6, de are the types of "input variable", and each is represented by two bits. Specifically, 
x is 00, 0 is 01 and do is 10, 

The third item "type flag" represents either the value of the "terminal state" or the range of the "transitional state". 
The type flag is composed of two bits, with the value of the "terminal state" being represented by 01 and the range of 
so the "transitional state" being represented by 10. 

The last H value(s)" differs depending upon the type flag. If the type flag is 01 , the "value" is the "terminal value" (rep- 
resented by eight bits). Since there is only one "terminal value", this eight-bit data is repeated twice. If the type flag is 
10, the "values" are 16-bit data arrayed in the order of the minimum value and maximum value (each of which is repre- 
sented by eight bits) (see Figs. 9, 1 0 and 11). 

55 

Step 1 05 

A strategy is extracted by comparing action samples with regard to the output variable. 

More specifically, the initial value and terminal value of the output variable are stored in the data memory 32. This 
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is followed by comparing the ouiput data (action sample) at time t and the output data at the next sampling time t+1. If 
these items of output data differ, then the item of output data at time t and the item of output data at time t+1 are stored. 
This processing is repeated while incrementing the value of t from t « 0 to t - terminal time. The items of data thus 
stored in memory are referred to as "strategy samples". 
5 The strategy samples of the crane control system will now be described in detail. Reference will be had to the 

binary strings shown in Fig. 10. 

Fig. 13 illustrates the strategy samples of the crane control system. Here Time t and Action SampSe No. also are 
shown for the sake of reference. 

The initial value 10000000 of the output variable (time t » 0.00, Sample No. = 1 ) and the terminal value 1 0000000 
10 (time t - 29-10, Sample Mo. 98) are stored in memory. 

Since the output variable value 10000000 at time t « 0.00 and the output variable value 1 1 1 1 1 11 1 at time t * 0.30 
differ, the output variable value at time t = 0.30 is stored (the output variable value at time t ~ 0,00 has already been 
stored). 

The output variable value 1 1 1 1 1 11 1 at lime t « 0.30 and the output variable value 10000000 at time t = 0.60 are 
is compared. Since these values are different from each other, the values are stored. 

Similarly the value at time t = 0.60 and the value at time t - 0.90 differ from each other, the value at time t = 0,90 
and the value at time t - 1 .20 differ from each other and so do the value at time t * 1 ,20 and the value at time t ~ 1 .50. 
These values, therefore, are stored. 

The values from that at time t = 1 .50 to time t ~ 1 4.40 are identical. Accordingly, the output variable values at these 
20 times are not stored. 

The value at time t - 14.40 and the value at timet ~ 14.70 differ. Accordingly, the output variable values at these 
times are stored. 

The value at time t = 14.70 and the value at time t = 15.00 are identical. Accordingly, the output variable values at 
these times are not stored. 

ss The value at time t - 1 5.00 and the value at time t « 1 5.30 differ from each other, the value at time t » 1 5.30 and 
the value at time t = 1 5.60 differ from each other and so do the value at time t = 1 5.60 and the value at time t = 1 5.90. 
These values, therefore, are stored, 

The vafue at time t - 1 5,90 and the value at time t * 1 6,20 are identical and, hence, these values are not stored. 
The value at time t = 1 6,20 and the value at time t - 1 6.50 differ and therefore are stored. 
30 The values of the output variables from time t - 16.50 onward are all the same until the terminal time t = 29.10 and 
therefore are not stored. 

Thus, the positions at which the output variable changes are found and the output variable values at the points in 
time immediately before and immediately after each of these positions are stored in memory. 

In the example given above, the description deals solely with a case in which the output variable is of one type. 
35 However, the procedure is the same even if ihe output variables are of a plurality of types. In such case operation is 
such that if the value of any one output variable changes, this is regarded as a change in the output variables and the 
values of all of the output variables immediately before and immediately after this change are stored. 

Step 106 

40 

The strategy samples thus extracted are converted to strategy cases, which have the following format upon refer- 
ring to the action samples: 

"do • • • when * * * until • • * min • • * max 
The value of the output variable (of one or a plurality of types) follows "do"; "when" is followed by the value (one or 
45 a plurality of values) of the input variable; and ,T untir also is followed by the value of the input variable. Further, minimum 
values regarding each of the input variables are arrayed after "min", and the maximum values regarding each of the 
input variables are arrayed after "max". 

This will be described using a simple example white referring to Figs. 1 4 through 16. In these Figures, the input var- 
iables are of two types, namely x 1 , x 2> and the output variable is of one type, namely u. Time is indicated at tp ~ tp+n . 
so Fig. 14 shows a simple example of action samples. Strategy samples created on the basis of the action samples in 
Fig. 1 4 are illustrated in Fig. 1 5. With regard to the output variable in Fig. 1 4, the acquisition of the strategy samples of 
Fig. 15 by the extraction of the values of the output variable at the points in time immediately before and after any 
change in the output variable is as described above in step 105. 

Fig. 16 is a graph of the action samples in the input-variable space. The input variables x 1p x 2 are plotted along 
55 respective ones of two orthogonal axes. Zones within which the output variable u maintains a fixed value are indicated 
by the rectangles. 

In three-dimensional input-variable space formed when there are three types of input variables, a zone in which the 
output variable indicates a fixed value is represented by a rectangular parallelepiped. In four-dimensional input-variable 
space, the zone is represented by a hyperbox. In the description that follows, the term Tiyperbox 1 ' is used with regard 
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all cases, inclusive of the rectangle and rectangular parallelepiped, irrespective of whether the input-variable space is 
two-dimensional or three-dimensional. 

in Fig. 16, a hyperbox Q is a zone within which the output variable u is 90, and a hyperbox Q+1 is a zone within 
which the output variable u is 100. 
5 The set of input and output variables defining one hyperbox is referred to as a strategy case. If the strategy case Q 

is taken as an example, this strategy case is expressed as follows: 
do u = 90 



when x 1 - 3.0, x 2 - 20 

io until x-} « 4,5, x 2 s= 45 

rnin x 1 ■ 3,0, x 2 » 10 

max x-j = 5.5, x 2 = 45 



The meaning of this Is as follows: 
is 'When the inputs are = 3, x 2 = 20, make the output u 90 until Xi - 4.5, x 2 = 45 is established. However, the input 
X| has an allowable range from a minimum value 3,0 to a maximum value 5.5, and the input x 2 has an allowable range 
from a minimum value 10 to a maximum value 45/' 

One vertex of the hyperbox Q is represented by the minimum value of x 1 and the minimum value of x 2 (this point is 
indicated by MIN in Fig. 16), and the diagonally opposite vertex is represented by the maximum value of x 1 and the 
20 maximum value of x 2 (this point is indicated by MAX in Fig. 1 6), 

it may readily be understood that the strategy case G+ 1 is expressed as follows: 
do u ^ 100 



when x 1 = 3.5, x 2 = 47 

25 until x-j -1.0, x 2 -34 

mfn x 1 = 1 .0, x 2 - 34 

max x-j - 3.5, x 2 - 48 



In the example of the crane control described above, the graph of the action samples in the input space is as shown 
30 in Fig. 17. For the sake of simplicity, Fig. 1 7 illustrates a two-dimensional input-variable space in which the orthogonal 
axes are position x and sway angle 6. (This should of course be written more precisely as three-dimensional input-var- 
iable space in which the orthogonal axes are x, 6 and de,) 

Strategy cases obtained by referring to the strategy samples shown in Fig. 13 are illustrated in Figs. 18a, 18b and 
19. The strategy cases in Figs. 18a, 18b are represented using numerical values expressed as decimal numbers (see 
35 Fig. 9), and the strategy cases in Fig, 19 are obtained by converting these numerical values to binary strings (see Fig, 
10). 

In Figs. 1 8a, 1 8b, "do:" is followed by the output variable value. Further, the input variable values (the input variable 
values of the "when" item) are arrayed following "when:" in the order of the input variables x r 6, d0. Similarly, "until:", 
"mini" and "max:" are each followed by the input variable values (the input variable values of the "until" item, "min" item 

40 and "max" item) in the order x, e r de. 

In Fig. 19, the binary codes are represented by eight bits for each variable value. The sequence is as follows: the 
input variable values of x F 9, de in the "when" item, the input variable values of x, 8, de in the "until" item, the input var- 
iable values of x, 0, de in the "rnin" item, the input variable values of x, 6, de in the "max" item and finally the output var- 
iable value of u in the "do" item. 

4$ In Fig. 17, the strategy cases 1 r 2, 3, 4 r 7 and 8 are not rectangles (hyperboxes) but are represented by single 
points (these points can also be considered as being covered by the concept of the hyperbox). 

Step 107 

so The strategy cases are optimized by neglecting (deleting)- unnecessary strategy cases in the total of strategy cases 
obtained by the above-described processing. 

First, it is determined whether two or more strategy cases have the same output variable value. In a case where 
two or more strategy cases having the same output variable value have been found, the nearness of these cases is cal- 
culated based upon the input variable values (the hyperboxes). In a case where the cases are near to each other, one 
55 is left and the others are discarded or, alternatively, the strategy cases are combined- 

For example, in a case where the hyperboxes relating to two or more strategy cases overlap perfectly, all but one 
are discarded. 

Furthermore, in a case where the distance between two or more hyperboxes is calculated and the distance 
obtained is less than a predetermined threshold value, the strategy cases are combined (or merged) or one or more is 
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discarded. For example, with regard to a point in the input variable space represented by the input variable values writ- 
ten following the "when" (or "until") item (or with regard to a MINI point or MAX point), the distance between these points 
in the two cases is used as the above-mentioned distance- 
There are many approaches available as methods of making the number of strategy cases as small as possible, 
s in the example of the crane control system, cases in which the values of the output variable u are equal are numer- 

ous. However, since the points represented by the input variable values of these cases are fairly distant all of the strat- 
egy cases are construed as being necessary and are left. 

Step 108 

10 

The goal cases and strategy cases thus created are associated in the following manner: 
Sub-step 108-1 

is With regard to the first strategy case (see Fig. 18a or Fig. 19), the status quantities of the case are compared with 
the goal condition of each goal casa The status quantities mentioned here are the input variable values following the 
"when** item (or the input variable values following the "until" item). 

Sub-step 108-2 

20 

in a case where a status quantity does not satisfy the goal condition, the first strategy case is associated with this 
goal case. If a status quantity does satisfy the goal condition, no association is made. The reason for this is that when 
a goal condition has been satisfied, a strategy leading to the goal is no longer necessary. 

25 Sub-step 1 08*3 

The processing of sub-steps 108-1 and 108-2 is repeated for all strategy cases. 

This linking together of goaf cases and strategy cases will now be performed in concrete form in the example of the 
crane control system taking an input variable value of the "when" item as the status quantity. 
so Refer to the goal cases shown in Fig. 12 and the strategy cases depicted in Figs. 18a, 18b. 

In Strategy Case No. 1 , the value of the input variable x written following the "when" item is 0.000. This value does 
not satisfy the goaf condition (value of "terminal state" ~ 9.467) of Goal No, 1 . Accordingly Strategy Case No. 1 is asso- 
ciated with Goal Case No. 1. 

The value of the InpuX variable 6 written following the "when" item of Strategy Case No. 1 is 0.000. This value sat- 
35 isfies the goal condition [range of "transitional state" » {-0.029, 0.027)] of Goal Case No. 2. Accordingly Strategy Case 
No. 1 is not associated with Goal Case No. 2, 

Similarly, the value of the input variable de written following the "when" item of Strategy Case No. 1 is 0.000. This 
value satisfies the goal condition [range of "transitional state" = (-0.039, 0.042)] of Goal Case No. 3. Accordingly, Strat- 
egy Case No. 1 is not associated with Goal Case No. 3. 
40 in the end r Strategy Case No. 1 is linked only to Goal Case No. 1. 

The f of lowing results are obtained in a simitar manner when the sub-steps 108-1 , 1 08-2 are applied to the Strategy 
Cases Nos. 2 ~ 10: 

Strategy Case Nos. 2, 3, 4, 5, 6, 8, 9 and 10 are linked only to Goal Case No. 1 ; and 
45 Strategy Case No. 7 is linked to Goal Case Nos. 1 and 7. 

Conversely the following results are obtained from the viewpoint of the goal cases: 

Goal Case No. 1 is linked to Strategy Case Nos< 1, 2, 3, 4, 5, 6, 7, 8, 9 and 10; 
so Goal Case No. 2 is not linked to any strategy case; and 
Goal Case No. 3 is linked only to Strategy Case No. 7. 

In order to store these associations in memory each strategy case has pointers pointing to all goal cases that are 
associated with it. (These pointers are stored to correspond to the data representing each strategy case.) Further, each 
55 goat case has pointers pointing to ail strategy cases associated with it. 

Fig. 20 illustrates the pointers provided for the goal cases. Here PL represents a pointer to a strategy case or cases, 
and the numbers indicated by the arrow following PL are the numbers of the strategy cases linked to the particular goal 
case. 

Fig. 21 illustrates the pointers provided in correspondence with the strategy cases. A pointer PH is a pointer to a 
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goal case or cases, and a pointer PL is a pointer to an action case or cases (described next). A number indicated by 
the arrow following PH is the number of the goal case linked to the particular strategy case. A number indicated by the 
arrow following PL is the number of the action case linked to the particular strategy case. 

Such linking of strategy cases and goal cases is displayed or printed as necessary H required, the user can add 
5 new strategy cases, delete unnecessary strategy cases or modify existing strategy cases. 

Step 109 

Strategy cases and action cases are Stnked in the manner set forth below. The term "action case" signifies a set of 
10 input variable values and output variable values at each point in time and has substantially the same definition as 
"action sample". However, action samples having exactly the same values are excluded from the action cases. 

Sub-step 109-1 

15 With regard to the first action case (see Fig. 9 or 10), the status quantity of this action case is compared with the 
strategy condition of each stfategy case. Here the term "status quantity" signifies the set of input variable values in the 
action case. The term "strategy condition" signifies a range from the set of input variable values written following the 
"when" item in a strategy case to the set of input variable values written following the item "until" in the strategy case. 
(This range will be referred to as the "when ~ until range" below.) 

20 

Sub-step 109-2 

In a case where a status quantity satisfies a strategy condition, the first action case is linked to this strategy case. 
25 Sub-step 109-3 

The processing of sub-steps 109-1 and 109-2 is repeated for all action cases. 

Such linking will now be attempted in regard to the crane control system described above, with reference being had 
to Figs. 9, 1 8a and 1 8b. The action samples shown in Fig. 9 are action cases. However, since the first action case rep- 
30 resents the quiescent state of the trolley, this case is meaningless in terms of model building and problem solving and 
no particular linkage is necessary. Accordingly, linking is performed starting from the second action case. 

In the second action case (No, 2), the status quantity is x *= 0.000, 0 ~ 0.000, de - 0.000. This falls within the "when 
~ until range" (x = 0,000 ~ 0,000, 9 * 0.000 ~ 0.000, de *= 0.000 ~ 0.000) of Strategy Case No. 1 . The status quantity 
of Action Case No. 2 does not fall within the "when ~ until range" of the other strategy cases. Accordingly, Action Case 
35 No. 2 is linked onfy to Strategy Case No. 1 . 

When this linking processing (sub-steps 1 09-1 , 1 09-2) is performed in similar fashion with regard to the other action 
cases, the associations shown in Fig. 22 are obtained. 

As shown in Fig. 22, for every action case number a strategy case linked to this action case is indicated by the 
pointer PH. The pointer PH points from the action case to the associated strategy case. The associated strategy case 
40 number is written immediately following the arrow of the pointer PH. 

Each strategy case also is provided with a pointer to the action case or cases linked to it. This is indicated by PL in 
Fig, 21. The number or numbers indicated by the arrow of the pointer PL provided lor each strategy case are the 
number or numbers of the action case or cases associated with this strategy case. 

It will be appreciated that a model composed of goal cases, strategy cases and action cases has a hierarchy struc- 
45 ture, as shown in Fig. 23, and that the goal cases, strategy cases and action cases are mutually associated. The goals 
(goal cases) occupy the highest ievel of the hierarchy (a conceptual decision level), the strategies (strategy cases) the 
intermediate level (a situational decision level) and the actions (action cases) occupy the lowermost level (an experi- 
ence decision level). 

so Step 110 

The model thus built is subjected to verification if necessary. 

In order to perform verification, the abovemerrttoned action samples (teaching data; see Fig. 9 or 1 0) are used. By 
using the \npu\ data of the action samples, output Is calculated in accordance with the previously constructed mode! just 
55 as in the procedure employed in a problem-solving mode, described later. (However, acceptance of input variable val- 
ues and encoding at a step 121 are not necessarily required.) The output data obtained is compared with the output 
data in the action sample. If both items of data are equal values or nearly equal values (e.g., if the difference between 
the two items of data is less than a predetermined threshold), then the model is construed as being one created cor- 
rectly. 
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If necessary, input data may be generated randomly and the input data may be applied to the system. The system 
calculates output data in accordance with the model and delivers this outpui data. Whether or not the output data is 
appropriate is judged by the user. 

s Step 111 

Finally, the data representing the mode! constructed is saved by being transferred from the work area of the mem- 
ory to a file area. 

10 (4) Processing in the probi em-solving mode 

In the problem-solving mode, the system actually controls (runs or manipulates) the controlled object using the 
above-d escribed model (see Fig. 2). The processing procedure is illustrated in Fig. 24. 

15 Step 121 

Input variable values from the sensors 1 1 , 12 r 1k (Fig, 2) or from rotational transducers representing the position x 
and angle sensors representing the angle of sway (Figs. 4 and 5) are accepted by the system 1 . These input values are 
converted to binary data at a suitable resolution, 

20 

Step 122 

The input data obtained is compared with the goal conditions oi the goaf cases contained in the model. If there are 
goal cases for which the goal conditions are not satisfied by the input data, then the strategy cases associated with 
2$ these goal cases are listed. 

In this step the system performs decision making at the conceptual level by deciding appropriate goal cases. 
Appropriate strategy cases are decided as a result 

Step 123 

30 

The input data is compared with the strategy cases selected at step 122. If there is a strategy case having a strat- 
egy condition (the "when - until range") which matches the input data, then the action cases associated with this strat- 
egy case are listed. 

At this stage the system is attempting to perform situational decision making by deciding appropriate strategy 
35 cases. As a result, appropriate action cases are decided. 

Step 124 

From among the action cases selected at step 123, that having input variable values that match perfectly with the 
40 input data or that having input variable values closest to the input data (i.e., that having the shortest distance in the input 
variable space) is selected as a candidate. 

Step 125 

45 In general, one candidate is selected at step 124 and therefore the output variable value of the action case which 
is the candidate is delivered as the output data. In a case where two or more candidates exist, the best one is selected 
and the output variable value of this action case is delivered as the output data. For example, when the hyperboxes of 
two strategy cases partially overlap and input data is contained in the overlapping space, different output variable val- 
ues are deduced from the two strategy cases. In this case, The output variable value selected is that obtained from the 

50 hyperbox for which the distance between the input data and the hyperbox vertex (the MIN or MAX point) is the shorter, 
by way of example. Alternatively, the output value initially obtained in processing is selected unconditionally 

Fig. 25 illustrates input data obtained when the crane control system is automatically operated in accordance with 
the model, output data (used in crane control) computed in accordance with the model and the numbers of the cases 
selected in the course of obtaining the output values. The three numerals (connected by the arrows} within the paren- 

55 theses on the rightmost side indicate the goal case number, the strategy case number and the action case number, 
respectively, from left to right. 

For example, the input data of Input/Output Data No. 1 is x » 0.000, e a 0.000, de a 0,000. These items of input 
data do not satisfy the goal condition of Goal Case No. 1 but they do satisfy the goal conditions of the other goal cases 
2, 3 (see Fig. 12). Accordingly, the ten strategy cases 1, 2, 3, 4, 5, 6, 7, 8, 9 and 10 associated with goal case 1 are 
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selected (see Fig. 20), (This is step 122.) 

Among these strategy cases 1—10, that having a strategy condition which satisfies the above-mentioned input 
data is soleiy Strategy Case No. 1 (see Fig. 18a). Accordingly, Action Case No. 2 linked to Strategy Case No. 1 is 
selected (see Fig. 21). (This is step 123,) 

s Accordingly, the output variable value u = 40.00 of Action Case No. 2 is delivered as ihe output data (see Fig. 9), 

(This corresponds to steps 124, 125). 

Not only does the input data x at the ^nput/Ouipui Data Nos, 8 ~~ 1 0 in Fig. 25 not satisfy the condition of Goal Case 
No, 1 but neither does the input data e satisfy the goal condition of Goal Case No. 2. Accordingly, Goal Cases Nos. 1 
and 2 are selected. However, a strategy case associated with Goal Case No. 2 does not exist. From the strategy cases 

w 1-10 associated with Goal Case No. 1 , therefore, a strategy case which is matched by the input data is selected. Strat- 
egy cases which have strategy conditions that satisfy all of the input data x, e, de of the Input/Output Data Nos, 8-10 
do not exist In a case such as this, the strategy case selected is that nearest to the input data (e.g., a strategy case for 
which the input data regarding some of the input variables satisfies the strategy condition, or a strategy case having a 
hyperbox at a distance closest to the input data). Strategy Case No. 5 is selected, 

15 Among the action cases associated with Strategy Case No. 6 (see Fig. 21), action cases 9 t 10, 1 1 having input var- 
iable values closest to the input data of each of the Input/Output Data Nos. 8 t 9, 10, respectively, are selected, and the 
output variable value (u = 40,000) of these action cases is delivered as the output data, 

■ The processing of steps 1 21 ~ 1 25 in Fig. 24 is repeated whenever input data is accepted from the controlled object 
(Le, r at fixed time intervals,) 

20 As many apparently widely different embodiments of the present invention can be made without departing from the 
spirit and scope thereof, it is to be understood that the invention is not limited to the specific embodiments thereof 
except as defined in the appended claims. 

Ciaims 

25 

1 . A reasoning method comprising the steps of: 

(a) accepting a series of action samples each comprising a set of input and output variable values obtained by 
operating or controlling an object; 
30 (b) creating, for each input variable, a goal case stipulated using an initial value, a terminal value, a maximum 

value and a minimum value of the input variable value in the accepted action samples; 
(c) extracting zones of input variables in which the output variable value in the accepted action samples 
remains fixed, and creating a strategy case, with regard to each zone, which links the zone of input variables 
to the output variable value; 

3S (d) associating a strategy case, which possesses a zone of input variables that does not satisfy a condition of 

a goal case, with said goal case; and 

(e) associating an action case, which possesses input variable values that fail within the zone of input variables 
in a strategy case, with said strategy case; 

40 whereby a mode! of a hierarchy structure constructed from mutually associated goal cases, strategy cases 

and action cases is built. 

2, The method according to claim 1, further comprising a step of converting the accepted action samples to binary 
data of prescribed bits. 

45 

3. The method according to claim 1 , wherein said step (b) includes the steps of: 

creating a provisional goal case; 
outputting the provisional goal case created; and 
50 adopting a goal case, after it has been evaluated by a user, as a final goal case. 

4, The method according to claim 3, wherein the evaluation performed by the user includes adding, deleting or mod- 
ifying goal cases. 

as 5* The method according to claim 1 , wherein said step (c) includes the steps of: 

(c-1) creating strategy samples by successively comparing neighboring output variable values arrayed m a 
time sequence, and storing, when a change has occurred, the output variable values immediately before and 
immediately after the change; 
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(o2) converting said strategy samples to strategy cases, which possess a "do • • • when - * * until - * * min 
- • • max'* format, by referring to said action samples; and 
(c-3) optimizing said strategy cases after conversion. 

5 6, The method according to claim 5, wherein said step (c-3) comprises reducing the number of two or more strategy 
.cases having identical output variable values. 

7. The method according to claim 1 , wherein linking of strategy cases and goal cases is implemented by pointers to 
the goal cases provided for each of the strategy cases and pointers to the strategy cases provided for each of the 

10 goal cases. 

8, The method according to claim 1 , wherein linking of action cases and strategy cases is implemented by pointers to 
the strategy cases provided for each of the action cases and pointers to the action cases provided for each of tie 
strategy cases. 

15 

9, The method according to claim 1 , wherein an action case is created by leaving one and deleting the others of action 
samples comprising sets of all identical input and output variable values. 

10, The method according to claim 1 1 further comprising the steps of: 

20 

(f) accepting said series of action samples one at a time in the time sequence thereof using said action sam- 
ples as data for verifying said model; 

(g) finding goal cases having conditions not satisfied by the input variable values of an accepted action sample 
and listing strategy cases associated with the goal cases found; 

2$ (h) finding, from among the strategy cases listed, a strategy case having a condition satisfied by the accepted 

input variable values and listing action cases associated with the strategy case found; and 
(i) from among the action cases listed, outputting an output variable value of an action case having input vari- 
able values that agree with or are nearest to the accepted input variable values; 

30 whereby said model is verified. 

11. A control method of operating or controlling an object using a model created by the reasoning method defined in 
claim 1 f comprising the steps of: 

35 successively accepting input data obtained from the object; 

finding goal cases having conditions not satisfied by the accepted input data and listing strategy cases associ- 
ated with the goal cases found; 

finding, from among the strategy cases listed, a strategy case having a condition satisfied by the accepted 
input data and listing action cases associated with fie strategy case found; 
40 from among the action cases listed, outputting an output variable value of an action case having input variable 

values that agree with or are nearest to the accepted input data; and 
repeating the foregoing processing whenever input data is accepted. 

12. A reasoning method comprising the steps of: 

45 

accepting a series of action data each comprising a set of input and output variable values obtained by oper- 
ating or controlling an object; 

creating, for each input variable, in view of the input variable values in the accepted action data, a goal of said 
operation or control; 

so creating, for each output variable, in view of the output variable values in the accepted action data, a strategy 

for attaining said goal; 

associating the goals with strategies which have not yet attained the goals; and 
associating the strategies with action data which implement them; 

55 whereby a model of a hierarchy structure constructed from mutually associated goals, strategies and actions 

is built. 

13* A reasoning system comprising: 
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(a) means for accepting and storing a series of action samples each comprising a set of input and output var- 
iable values obtained by operating or controlling an object; 

(b) means for creating, for each input variable, a goal case stipulated using an initial value, a terminal value, a 
maximum value and a minimum value of the input variable value in the accepted action samples; 

5 (c) means for extracting zones of input variables in which output variable value in the accepted action samples 

remains fixed, and a creating strategy cases, with regard to each zone, which links the zone of input variables 
to the output variable value; 

(d) means for associating a strategy case, which possesses a zone of input variables that does not satisfy a 
condition of the goal case, with the goal case; and 
*0 (e) means for associating an action case, which possesses input variable values that fall within the zone of 

input variables in a strategy case, with said strategy case; 

whereby a model of a hierarchy structure constructed from mutually associated goal cases, strategy cases 
and action cases is built. 

15 

1 4. The system according to claim 1 3, wherein said means (a) includes means for converting the accepted action sam- 
ples to binary data of prescribed bits. 

15. The system according to claim 13, wherein said means (b) includes: 

20 

means for creating a provisional goal case; 

means for outputting the provisional goal case created; and 

means for adopting and storing a goal case, after it has been evaluated by a user, as a final goal case. 

25 16. The system according to claim 1 5, wherein the evaluation performed by the user includes adding, deleting or mod- 
ifying goal cases, 

17, The system according to claim 13, wherein said means (c) includes: 

30 (c-1) means for creating strategy samples by successively comparing neighboring output variable values 

arrayed in a time sequence, and storing, when a change has occurred, the output variable values immediately 
before and immediately after the change; 

(c-2) means for converting said strategy samples to strategy cases, which possess a "do • * * when * * - until 
• * * min * * - max" format, by referring to said action samples; and 
35 (c-3) means for optimizing said strategy cases after conversion, 

18. The system according to claim 17, wherein said means (c-3) comprises reducing the number of two or more strat- 
egy cases having identical output variable values. 

40 1 9. The system according to claim 13, wherein said means (d) implements linking of strategy cases and goal cases by 
pointers to the goal cases provided for each of the strategy cases and pointers to the- strategy cases provided for 
each of the goal cases, 

20* The system according to claim 13, wherein said means (e) implements linking of action cases and strategy cases 
45 by pointers to the strategy cases provided for each of the action cases and pointers to the action cases provided 
for each of the strategy cases. 

21. The system according to claim 13, wherein an action case is created by leaving one and deleting the others of 
action samples comprising sets of all identical input and output variable values. 

50 

22. The system according to claim 13, further comprising: 

(f) means for accepting said series of action samples one at a time in the time sequence thereof in order to use 
said action samples as data for verification purposes; 
55 (g) means for finding goal cases having conditions not satisfied by the input variable values of an accepted 

action sample and listing strategy cases associated with the goal cases found; 

(h) means for finding, from among the strategy cases listed, a strategy case having a condition satisfied by the 

accepted input variable values and listing action cases associated with the strategy case found; and 

(t) means for outputting, from among the action cases listed, an output variable value of an action case having 
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input variable values that agree with or are nearest to the accepted input variable values; 
whereby said model is verified. 

5 23. A control system for operating or controlling an object using a model created by the reasoning system defined in 
claim 13, comprising: 

means for successively accepting input data obtained from the object; 

means for finding goal cases having conditions not satisfied by the accepted input data and listing, in memory, 

10 strategy cases associated with the goal cases found; 

means for finding, from among the strategy cases listed, a strategy case having a condition satisfied by the 
accepted input data and listing , in memory action cases associated with the strategy case found; 
means for ouiputting, from among the action cases fisted, an output variable value of an action case having 
input variable values that agree with or are nearest to the accepted input data; and 

15 means for performing control so as to repeat the foregoing processing whenever input data is accepted. 
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Fig . 25 
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